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[57] ABSTRACT 

Audio quality perception can be improved in an audio 
compression system 100 by defining a frequency subband 
threshold 205. With the frequency subband threshold 205, 
each frequency subband favorable to the threshold will be 
allocated at least a predetermined number of bits while 
frequency subbands unfavorable to the threshold will 
receive no bit allocations. This determination is made on an 
audio frame by audio frame basis, or upon change of the 
audio compression ratio, since changing any of the param- 
eters: bitrate, sampling rate and coding mode changes the 
compression ratio. 

6 Claims, 3 Drawing Sheets 
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BIT ALLOCATION METHOD FOR 
IMPROVED AUDIO QUALITY PERCEPTION 
USING PSYCHOACOUSTIC PARAMETERS 

This is a continuation of application Ser. No. 08/207,995. 
filed Mar. 9, 1994 and now abandoned. 

FIELD OF THE INVENTION 

This invention relates generally to the field of audio 
compression, and, in particular, to improved audio quality 
perception. 

BACKGROUND OF THE INVENTION 

Communication systems are known to include a plurality 
of communication devices and communication channels, 
which provide the communication medium for the commu- 
nication devices. To increase the efficiency of the commu- 
nication system, audio that needs to be communicated is 
digitally compressed. The digital compression reduces the 
number of bits needed to represent the audio while main- 
taining perceptual quality of the audio. The reduction in bits 
allows more efficient use of channel bandwidth and reduces 
storage requirements. To achieve audio compression, each 
communication device may include an encoder and a 
decoder. The encoder allows the communication device to 
compress audio before transmission over a communication 
channel. The decoder enables the communication device to 
receive compressed audio from a communication channel 
and render it audible. Communication devices that may use 
digital audio compression include high definition television 
transmitters and receivers, cable television transmitters and 
receivers, portable radios, and cellular telephones, 

One type of audio encoder has a fixed target bit rate for 
the compressed audio information. The target bit rate is fixed 
by the desired storage media, the desired communication 
channel, and/or the desired level of audio quality at the 
decoder. Since the target bit rate is fixed, the number of bits 
available to an audio frame is also fixed. Another type of 
audio encoder is a subband encoder. Subband encoders 
divide the frequency spectrum of the signal to be encoded 
into several distinct subbands. The magnitude of the signal 
in a particular subband may be used in compressing the 
signal 

An exemplary prior art fixed-bit rate subband audio 
encoder is the International Standards Organization Interna- 
tional Electrotechnical Committee (ISO/IEC) 11172-3 inter- 
national standard, hereinafter referred to as MPEG (Moving 
Picture Experts Group) audio. MPEG audio allows bits to be 
assigned to each subband within an audio frame. On a frame 
by frame basis, the number of subbands that actually are 
allocated bits varies. This variation in bit allocations may 
produce an audible artifact similar to a chirp. One method 
for eliminating the audible artifact is to allocate a sufficient 
number of bits to each subband. While this approach works 
well for high bit rates, for low bit rates, this is impossible. 

Therefore, a need exists for a method and apparatus that 
eliminates the audio artifact in fixed-bit rate subband audio 
compression systems such as MPEG audio. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 illustrates an audio compression system that incor- 
porates the present invention. 

FIG. 2 illustrates an exemplary bit allocation process in 
accordance with the present invention. 

FIG. 3 illustrates a flow diagram of a method that may be 
used to implement the present invention. 
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FIG. 4 illustrates preferred frequency subband thresholds 
for corresponding output bit rates and input sampling rates 
for use in accordance with the present invention, 

5 DESCRIPTION OF A PREFERRED 

EMBODIMENT 

Generally the present invention provides a method and 
apparatus that improves audio quality perception. This is 
accomplished by determining a set of frequency subbands. 

10 This set of frequency subbands is a subset of the subbands 
available in an audio frame. For each audio frame, bits are 
allocated to the frequency subbands in the set based on 
psychoacoustic parameters. During the bit allocation 
process, the process ensures that at least a predetermined 

15 number of bits are allocated to each frequency subband in 
the set of frequency subbands. With such a method and 
apparatus, the audio artifacts of the prior art are substantially 
eliminated thus producing improved audio quality percep- 
tion. 

The present invention can be more fully described with 
reference to FIGS. 1-4. FIG. 1 illustrates an audio compres- 
sion system 100 that receives an audio frame, such as a 
frame of pulse code modulated (PCM) audio 101. The frame 

25 of PCM audio 101 is analyzed by a filterbank 102 and a 
psychoacoustic model 103. The filter bank 102 outputs a 
frequency domain representation of the frame of audio for 
several frequency subbands. The psychoacoustic model 103 
determines in conjunction with the bit allocation element 

30 104, the number of bits that should be allocated to each 
frequency subband based on the perception of the human 
ear. Each bit is iteratively allocated and the psychoacoustic 
parameters are re-analyzed after each iteration. The final bit 
allocation is outputted to the quantizer 105 by the bit 

35 allocation element 104. The quantizer 105 compresses the 
output of the interbank 102 to correspond to the final bit 
allocation. The bit stream formatter 106 takes the com- 
pressed audio from the quantizer and adds any header or 
additional information and formats it into a bit stream 107. 

40 The PCM audio 101 is typically a linearly quantized 
representation of an audio signal. The sample size varies 
depending on the requirements of the application, typically 
in the range of 8 to 24 bits. Compact discs and digital audio 
tape use 16 bits. The sampling frequency also varies, typi- 

45 cally in the range of 8 to 48 kHz. Compact discs use 44. 1 
kHz. In addition, multiple channels may be interleaved in 
the PCM audio signal. Compact disks and digital audio tapes 
use two channels. So for an example compact disc-based 
audio compression system, the input PCM audio is 16 bits 

50 per sample, 44. 1 kHz sampling frequency, and two channels, 
This yields an input bit rate of about 1.5 megabits per 
second. 

The filterbank 102 which may be implemented in accor- 
dance with MPEG audio by a digital signal processor such 

55 as the MOTOROLA DSP56002, transforms the input time 
domain audio samples into a frequency domain representa- 
tion. Filterbank 102 uses a small number (2-32) of linear 
frequency divisions of the original audio spectrum to rep- 
resent the audio signal. The filterbank 102 outputs the same 

60 number of samples that were input and is therefore said to 
critically sample the signal. The filterbank 102 critically 
samples and outputs N subband samples for every N input 
time domain samples. 
The psychoacoustic model 103. which may be imple- 

63 mented in accordance with MPEG audio by a digital signal 
processor such as the MOTOROLA DSP56002. analyzes the 
signal strength and masking level in each of the frequency 
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subbands It outputs a signal-to-mask ratio (SMR) value for each audio frame in the plurality of audio frames, could be 

each subband The SMR value represents the relative sen- PCM audio, and includes a plurality of frequency subbands. 

sitivity of the human ear to that subband for the given Typically, in MPEG audio each audio frame includes 32 

analyzed period. The higher the SMR. the more sensitive the frequency subbands. However, in different applications, the 

human ear is to noise in that subband, and consequently, 5 number of frequency subbands may vary, 

more bits should be allocated to it. Compression is achieved Depending upon whether bit allocation alternative #1 202 

by allocating fewer bits to the subbands with the lower orbit allocation alternative #2 206 is used, the process either 

SMR. to which the human ear is less sensitive. determines a set of frequency subbands or a frequency 

The bit allocation element 104. which may be imple- subband threshold 301. When using alternative #1 202, the 

raentcd with a digital signal processor such as the "> set of frequency subbands is based on the selected audio 

MOTOROLA DSP56002. uses the SMR information from compression ratio. The audio compression ratio is based on 

the psychoacoustic model 103, the desired compression an input sampling rate of each audio frame and an output bit 

ratio and other bit allocation parameters to generate a rate of the audio compression system. As examples of the 

complete table of bit allocation per subband. The function of input sampling rate and output bit rate, an audio compres- 

the bit allocation element 104. will be described below with * 5 sion system's sampling rate may be from 8 to 48 kHz while 

reference to FIG 3 tne output bit rate may be 32 to 448 kilobits per second. As 

The quantizer 105 which may be implemented in accor- described above, the higher the bit rate, the more bits are 

dan* ' fflSraG auZoby Tdigital^nal processor such available for bit ^location. Thus, the .fewer the bits available 

.7 SniTnQPwn? the hit allocation for aUocation. the smaller the set of frequency subbands or 

specified nun**, of bits'! Various ^* Z^lZT^T^r^n 

bv processing the scaled subband sample through a linear below that number are included while frequencies above .that 
oy processing u«= "tr »" 25 numbcr ^ excluded. For example, if a compression ratio of 

The bit stream formatter 1W which may be implemented "» ' 
in accordance with MPEG audio by a digital signal proces- 30 ' . ,.„... „ . . „ 
sor such as the MOTOROLA DSP56002, takes the quan- Having set the frequency subband thresho d <x ^having 
tized subband samples from the quantizer 105 and packs determined the set of frequency subbands, a predetermined 
^TtoZ^ctrnW^vnthhctteMtxartio*. number <* bits are aUocated to each frequency subband in 
bit allocation information, scale factor informatioa. and any the set or that is favorable to the frequency subband thresh- 
er s^ Information the coder requires. The bit stream is „ old 302. As mentioned above theprede ™^ "tunber of 
output at a rate equal to the PCM audio 101 input bit rate bits uybeoieor any number Aat the system can readdy 
^T. ' '„ ' „tfr, sustaio. With an initial tut allocation, the audio compression 
divided by the compression ratto ^ ^ biK ^ a ^ haye 

The presen invention elimuiates fce audio artifad t by ^ exhausted30 3. When bits are remaining to be allocated 

d ^^^,^1?f2 ! 3l within the bit set 303. the remaining bits are aUocated. on a 

frequency subbands. As illustrated in FIG. 2. the audio «, in me sct „ to subballds 

compression system 100 receive, ; a pluraUty - of ^frames ? ^ Qa choacoustic 

200. Each audio frame 201 includes a plurality of frequency f ft ^ j^s process repeats until 

subbands 203. 207. Each of the frequency subbands has a S!wT^!« «h.«^A 

corresponding number of allocated bits 204. 208. As shown the bit set .s exhausted 303. 

in aUocation alternative #1 202. the frequency subband 43 Having exhausted the bit set the audio compression 

threshold 205 is utilized. In mis alternative 202. frequency system then goes to the next frame 305 and subsequently 

subbands favorable to. or below, the frequency subband determines whether a change in audio compression ratio has 

threshold 205 are aUocated bits while frequency subbands occurred 306. If a change in audio compression ratio has 

above the threshold are not. To ensure elimination of the occurred, the process repeats at step 301. Continuing with 

audio artifact, each frequency subband below the frequency x the example above, where a compression rauo of 8 to 1 was 

threshold 205 must have at least a predetermined number of selected, if the compression ratio were changed to 12 to , 1. 

bits aUocated to it. This predetermined number of bits may the frequency subband threshold would be 9 such jtat 

be one or any other number that the system can support The frequency subbands 1-9 would be aUocated bits while 

process in which the bits are aUocated to the frequency frequency subbands 10-32 would receive no bit allocations, 

subbands wiU be described with reference to FIG. 3 below. S5 If no change in audio compression ratio has occurred 306. 

FIG 2 also Ulustrates an alternative bit aUocation 206. the processmay proceed in one of two directions. If path 307 

which shows a set of frequency subbands 209. In this is chosen, the process repeats at step 302. However, if path 

alternative 206. each frequency subband in the set of fre- 308 is chosen, the set of frequency subbands or frequency 

quency subbands 209 are aUocated at least a predetermined subband threshold is again determined. If this path 308 is 

number of bits. Note that the set of frequency subbands can eo chosen, me sel of frequency subbands or the frequency 

be a contiguous set or a non contiguous set as shown in FIG. subband threshold is being determined on a frame by frame 

2 Again the bit aUocation wfll be described with reference basis. When this path is chosen, the frame by frame basis 

to FIG 3 may change from using the set in one frame to the threshold 

FIG.' 3 Ulustrates a flow diagram that may be used by the in another frame, and vise versa, 

audio compression system 100 to implement the present «5 FIG. 4 Ulustrates a table of corresponding output bit rates 

invention. At step 300. the audio compression system 400 and input sampling rates 401. For each input sampling 

receives a pluraUty of audio frames. As described above. rate 401. a stereo column 403 and a mono column 404 are 
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shown. The stereo column 403 indicates that the audio input 
includes a left and right channel. The mono column 404 
indicates that the audio input has a single channel. For each 
column of input sampling rate and corresponding bit rate, a 
number is shown indicating either the number of frequency 5 
subbands in the set of frequency subbands. or the frequency 
subband threshold 402. As shown in the table, at times, the 
set or frequency threshold 402 is shown to be a minus 1. In 
this instance, the present invention is not needed, in that, the 
output bit rate is sufficiently high to eliminate the audio 10 
artifact. When the set or frequency subband threshold 402 is 
not a minus 1. the number is representative of the subbands 
either within the set, or of the frequency subband threshold. 

The present invention provides a method and apparatus 
for improving audio quality perception. With such a method, 15 
the audio artifact produced in prior art systems which 
resulted from a varying number of frequency subbands 
receiving bit allocation, has been eliminated. With the 
elimination of the audio artifact, the audio quality as per- 
ceived by humans is enhanced. 20 

We claim: 

1. A method for bit allocation that improves audio quality 
perception in an audio compression system, wherein the 
audio compression system receives a plurality of audio 
frames, and wherein each audio frame of the plurality of 25 
audio frames includes a plurality of frequency subbands. the 
method comprising the steps of: 

a) determining a set of frequency subbands from the 
plurality of frequency subbands based on a selected 
audio compression ratio, wherein the set of frequency 30 
subbands is a subset of the plurality of subbands; and 

b) for each audio frame of the plurality of audio frames, 
allocating, based on psychoacoustic parameters of the 
each audio frame, bits of a bit set to frequency sub- 35 
bands in the set of frequency subbands until the bit set 

is exhausted, wherein at least a predetermined number 
of bits is allocated to each frequency subband in the set 
of frequency subbands wherein step (a) further com- 
prises: ^ 
defining the selected audio compression ratio to be a 
function of an input sampling rate of each audio 
frame of the plurality of audio frames and an output 
bit rate of the audio compression system; 
determining the set of frequency subbands on a frame 45 

by frame basis; and 
determining the set of frequency subbands when the 

selected audio compression ratio changes, 
wherein each frequency subband within a predeter- 
mined frequency subband threshold is allocated at ^ 
least a predetermined number of bits and frequency 
subbands not within to the predetermined frequency 
subband threshold fail to receive bit allocations. 

2. In the method of claim 1. step (b) further comprises 
defining the predetermined number of bits to be one. 55 

3. In the method of claim 1. step (a) further comprises 
defining the set of frequency subbands to be a contiguous 
subset of the plurality of frequency subbands. 

4. A method for bit allocation that improves audio quality 
perception in an audio compression system, wherein the M 
audio compression system receives a plurality of audio 
frames, wherein each audio frame of the plurality of audio 
frames includes a plurality of frequency subbands. and 
wherein the plurality of frequency subbands is sequentially 
numbered, the method comprising the steps of: ^ 

a) determining a frequency subband threshold based on a 
selected audio compression ratio; and 
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b) for each audio frame of the plurality of audio frames, 
allocating, based on psychoacoustic parameters of the 
each audio frame, bits of a bit set to frequency sub- 
bands of the plurality of frequency subbands having a 
sequential number favorable to the frequency subband 
threshold until the bit set is exhausted, wherein at least 
a predetermined number of bits is allocated to each 
frequency subband of the plurality of frequency sub- 
bands having a sequential number favorable to the 
frequency subband threshold wherein step (a) further 
comprises: 

defining the selected audio compression ratio to be a 
function of an input sampling rate of each audio 
frame of the plurality of audio frames and an output 
bit rate of the audio compression system; 

determining the frequency subband threshold on a 
frame by frame basis; and 

determining the frequency subband threshold when the 
selected audio compression ratio changes. 

wherein each frequency subband within a predeter- 
mined frequency subband threshold is allocated at 
least a predetermined number of bits and frequency 
subbands not within to the predetermined frequency 
subband threshold fail to receive bit allocations. 

5. In the method of claim 4, step (b) further comprises 
defining the predetermined number of bits to be one. 

6. An audio compression system that receives an audio 
frame of a plurality of audio frames, wherein the audio 
frame includes a plurality of frequency subbands. the audio 
compression system comprises: 

a filterbank that receives the audio frame; 

a psychoacoustic model that receives the audio frame, 
wherein the psychoacoustic model determines psy- 
choacoustic parameters of the audio frame; 

a quantizer that is operably coupled to the fUterbank. 
wherein the quantizer compresses the audio frame 
based on bit allocation information; and 

a bit allocation element that is operably coupled to the 
psychoacoustic model, wherein the bit allocation ele- 
ment provides the bit allocation information to the 
quantizer, and wherein the bit allocation element: 

a) determines a set of frequency subbands from the 
plurality of frequency subbands based on a selected 
audio compression ratio, wherein the set of fre- 
quency subbands is a subset of the plurality of 
frequency subbands, defines the selected audio com- 
pression ratio to be a function of an input sampling 
rate of each audio frame of the plurality of audio 
frames and an output bit rate of the audio compres- 
sion system, determines the set of frequency sub- 
bands on a frame by frame basis; and determines the 
set of frequency subbands when the selected audio 
compression ratio changes; and 

b) for each audio frame of the plurality of audio frames, 
allocates, based on psychoacoustic parameters of the 
each audio frame, bits of a bit set to frequency 
subbands in the set of frequency subbands until the 
bit set is exhausted, wherein at least a predetermined 
number of bits is allocated to each frequency sub- 
band in the set of frequency subbands. 

wherein each frequency subband within a predetermined 
frequency subband threshold is allocated at least a 
predetermined number of bits and frequency subbands 
not within to the predetermined frequency subband 
threshold fail to receive bit allocations. 

♦ * * * * 
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